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ON AN UNSYMMETRICAL PROBABILITY CURVE. 



BY E. L. DE FOKEST, WATERTOWN, CONN. 

When repeated observations of a quantity are made, and are liable to 
error through accidental or unknown causes, it is sometimes found that the 
facility of error is greater on one side of the arithmetical mean than on the 
other, so that the limits of possible error are different, and + and — errors 
of equal amount do not occur with equal frequency. Cases of this kind 
have been noticed by various writers; see for instance Quetelet's Lettres sur 
la Theorie des Probabilite s, pp. 180 and 410. He remarked a similarity 
between the apparent form of the curve of facility, and that of the series of 
terms in the expansion of a binomial p-\-q to a high but finite power, when 
p and q are very unequal. I am not aware that any writer has attempted 
to give the analytical equation of such a curve, and it is the object of the 
present paper to obtain one. We shall do this, however, in the most gen- 
eral way, regarding the desired curve of facility as a limiting form of the 
series of coefficients in the expansion of a polynomial, which may or may 
not be a binomial. The limit of a polynomial having none but positive co- 
efficients has been investigated in a peculiar manner, by Laplace and subse- 
quent writers, and found to be the common probability curve. See, for 
instance, Meyer, WahrscheinlichJceitsrechnung, Leipsic 1879, pp. 141, 350, 
407, 442.* A simpler and very different way of obtaining such results was 
given by me in the Analyst, Sept. 1879 and Sept. 1881, and I have ex- 
tended it to the cases of polynomials of two and three variables. Although 
the general form of the ultimate limiting curve for a polynomial of one vari- 
able is represented by the probability curve 

yet it can be shown that the actual form of an expansion to a high power 
approaches still more closely to another and more complex curve, of which 
(1) is only a special case. To discover the nature of this curve, we must 
take precautions to insure the most accurate approximation. 

First, we shall use the method of symmetrical differences. Having an 
unlimited series of equidistant terms 

.... (j_ 2 , I'i-i, H, k+\> H+2) .... (2) 

we take their differences so as to keep l t always in the middle. The differ- 
ences of even order are 

*Meyer's work is a valuable compendium of the science of probability, brought down to 
recent date. 
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zf 2 = —%+k+i+k-!, J 4 = 6Z { — 4(Z j+1 4-4-l)+4+2+4-2, 

Those of odd order would in the usual form be 

A ~ 4+K 4-J0 ^3 == 4+lJ£ 3/ <+ ^ + 3^_^ li_ v/% ,&C. 

But since only the terms in (2) are supposed to be given, we make the 
hypotheses 

4-k — 1(4-1+4), h+y, = i(h+k+i)> 4+ik = i(4+i + 4+2)5 Ac., 
and by substitution get expressions which we agree to represent by J lf J 3 , 
&c, thus 

^1 = 2 (4+i — 4-i), ^3 = iPi+2 — 4-2 — 2(4+ 1 — 4-i)]» \ /Q\ 

4 5 = IC4+3 - 4-8 - 4 (4+2 - 4- 2 )H- 5(4+1 - 4-l)], &C. I W 

It will ))e found that each difference of odd order thus formed is half the 
sum of the two nearest differences of the same order formed in the usual 
way, one on each side of the centre or place of l { . Thus for example 

^3 = i[(4 +2 — 34+i +34 - 4_i ) + (4+i — 34+ 3ft_! — 4- 2 )]- 

From the expressions for J lf J 2 , &c. we get by successive eliminations 
fctl = k±4 1 +y 2 , &±2 = l i ±2J 1 +2J 2 ±J 3 +y i , 
and in general 

4+» - 4 +1 J i+0 J2+ T^3~ Z,3+ T2lf Z,4+ 1A3A5~ 6 

+ &c, (4) 
where w may be either positive or negative. This formula, which may be 
used for making interpolations, is given by Lacroix, Calcul Diff. et Int., 
Paris 1819, Vol. III. pp. 26-28. 

[t was shown in my Analyst articles above cited, that if any polynomial 

X-. m Z~ m + .... +<*_ 1 2- 1 +^„+; i 2+ .... +hfi m , (f>) 

whose coefficients X may be either -f or — , is expanded to the h power, and 
the expansion is written 

L km z-* m +... +L 1 s- 1 +? + ^ 1 2+ . . . +4»«*-, (6) 

then any coefficient l t in the expansion, and the 2m coefficients nearest to it, 
will be connected by the relation 

(/■■jlj—1 /_x4+l)~rA"24— 2 — *— 24+2/~T • •' ~T m \Am'i—m "— m' i+»)_ % _ m\ 

^o4 + (^l4-l +^-l4+l)+(^24-2+^-24+2 + • • • +(^m4-M+^-m4+m) & + 1 

Let l i+1 , 4-i Ac, be expressed in terms of l t and the differences as in (4), 
and write also 

&o = Jo + ^i+J-iHC^+^-aH • — +&.+*—) 1 
b x = l(^—k_ y )+2(k 2 -X_ 2 )+....+ma m —X_ m ) | 

6 2 = ^ ] +i- 1 )+2 2 (i 2 +i_ 2 )+....+m 2 (i+U )■ (8) 

6 3 = l^ 1 _A_ 1 )+2 3 (A 2 -A_ 2 )+....+m 3 (; m -A_ M ) 

&c. &c. J 
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Denoting the numerator and denominator in the first member of (7) by 
iV"and D, and writing 1.2.3 . . . n = n !, we get 

N=b 1 l i -b 2 A 1 +lb 3 A 2 -^(b i -b 2 )A 3 +l ] (b 5 ~b 3 )A i -^ 



5 



X(6 6 -56 4 +46 2 )z* 
+1 (6 7 -56 5 +4b 3 ) A 6 - l ] (b 8 -8b 6 +19b i -12b 2 )J,+&c. 

X(6 5 -56 8 +46 1 )j" 
+^j (6 6 -56 4 +46 2 M 6 - 1 (6,-865+1963-126!)//, +&o, 



s 



(9) 



D i+1 

It will be seen that in the coefficient of A 2m+1 or ^ 2m+2 , the numerical 
coefficients of the 6 within the parentheses are those of the powers of n in 
the product of the factors 

(n» — l 2 ) (w 2 — 2 2 ) (?i 2 — m 2 ). 

When k becomes an infinity of the second order, that is, of a magnitude 
comparable with the quotient of a finite area by (dxf, and the successive 
values of I are regarded as consecutive ordinates y to a limiting curve which 
extends to an infinite distance over the axis of X, we have 

k — y, *i = dy, A 2 = d 2 y, A 3 = dSj, &c. 

The common interval Ax between ordinates becomes dx when they are set 
close together, and the abscissa corresponding to any y is x = idx. Thus 
(9) becomes the differential equation of the curve, and 6 , b lf &c. are con- 
stants. Any given polynomial may be reduced to one in which 2(X) = 1, 
by dividing it throughout by the sum of its coefficients. We have then 
6 = 1. If a constant number is added to or subtracted from all the expo- 
nents of z in (5), it will not alter the values of I in (6). Hence, as shown in 
my former article, b x may be reduced to zero by transferring the origin or 
place of 2 to the centre of parallel forces of the coefficients 1 in (5), when 
these are regarded as forces ranged equidistantly along the imponderable 
axis of X and acting upon it at right angles in the plane XY. Then any 
constant b n in (8) will denote the sum of the products formed by multiply- 
ing each X into the nth power of its abscissa reckoned from the new origin, 
if the common interval Ax between the coefficients or forces X is regarded 
as unity. But if any other unit of abscissas is employed, the sum of the 
products will be b n (Ax) n , or b n (dx)' 1 when the coefficients are set close togeth- 
er so as to be consecutive. We may now write (9) as follows. 
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b 2 dy— i& 8 dfy+K&4 


—b 2 )dhj- 


-&c 


X 


y+¥> 2 d\j— ^b s 


d 3 y+&c. 




{k+l)dx 



(10) 

In the numerator of the first member let d*y, d*y &e. be neglected in 
comparison with dy and d % y, and in the denominator neglect dhj, d 3 y &c. 
in comparison with y. Since h is infinitely large, we may write h instead 
offt + 1- Therefore 

dy—\{b z -*-b 2 )d 2 y _ —x ^ 
y kb 2 dx 

Invert both members of this equation, subtract J(6 s -*-& 2 ) from each, and 
invert them both back again. This gives 

dy—l{b z ^-b 2 )d 2 y _ — x 



y-W*+b 2 )dy + %b a -*-b a )*d*y kb 2 dx + Hb a +b a )x (U) 
In the denominator of the first member, let dhj be neglected in compari- 
son with y and dy. From the properties of the lever arms of the coeffic'nts 
in polynomials and their products, as shown by me in the Analyst arti- 
cles cited (see also March and Nov. 1880), it follows that since the origin 
or place of s° in the given polynomial (5) is transferred to the centre of 
forces for the coefficients A, the origin or place of z° in the expansion to the 
h power will be located at the centre of forces for the coefficients I in (6), 
which become the ordinates y to the limiting curve. 

When the A's are all positive, as they must be if they represent probabil- 
ities, the y's will be all positive, and their centre of forces is the same as 
their centre of gravity, if the ordinates are regarded as the masses of mate- 
rial points ranged along the X axis at intervals equal to dx. Now in (11) 
let the origin be transferred from the centre of gravity to another conveni- 
ent point by putting 



x — 



W>% dx 



2' 



6 a- (12) 

in place of x. This gives 

dy—l(b z ^-b 2 )d 2 y 4kb 2 dx—2(b z +b 2 )x . „ 

In the first member, the numerator is the differential of the denominator. 
Without any further change of origin, we can write approximately 

y+Ub s -i-b 2 )dy and x+^{b 3 ~b 2 )dx 

in place of y and x respectively, neglecting d 3 y in the numerator and d 2 y in 
the denominator, and so get 

dy _ 4hb 2 dx — (b 3 ^-b 2 ) 2 dx — 2(b s ~b 2 )x 

J {b^b~;Yix+^b s -^)dx2~ * 

Since the denominator y in the first member is supposed to be infinitely 
greater than the numerator dy, the denominator in the second member must 
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be infinitely greater than its numerator, so that in the denominator we may 
neglect dx in comparison with x. Also let the constants be expressed by 
means of two new constants 

a = w> b = hb ^ ? - (14) 

Since h is supposed to be an infinity of the second order, b represents a 
finite area. The equation will now stand 

$L = — (a*b — 1) — adx, (15) 

y x 

and integration gives 

log'y = (a 2 6 — 1) log's; — ax-\-\og'G, 

.'.y = Ox aib - 1 e- ax . (16) 

This is a more accurate relation between y and x than we could have 

obtained without the use of symmetrical differences. In (15) dy is not the 

difference of the ordinates at a; and x-\-dx, but of those at a; — %dx and x 

+%dx. 

The finite difference of log'y is usually considered to be 

J log'y = \og'(y+Jy)— log'y = hg'(l+~^j ] 

=^(M* +i m s - &c . \ (17) 

y \y I \y ) J 

But we have taken it to be 

d log'y - log'(y+|%)-log'(y-|Jy) = log'(l+|j)-log'(l-|p ^ 

=^uf)+m^ . i (18> 

These expressions show that when dy-r-y is regarded as the differential of 
log'y, the magnitude of the error committed is either 

according as we consider dy to be y i+1 — y< or y i+ii — yi-%- The second error 
is of a lower order of magnitude than the first. Conversely, if we take 
log'y to be the integral of dy-i-y, the accuracy of the result is increased when 
dy or Ay is understood in the sense here adopted. 
When we write 

ax = v, a 2 b = n, (20) 

the ordinate y in (16) is seen to be proportional to an expression having the 
known form 
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the element-function of the Gamma integral. The limiting form (16) of 
the expansion of a polynomial, therefore, is a curve shaped like the one 
whose area, between x = and x = oo, is r\n). For convenience, we may 
call it the gamma curve. It makes y = for x — and for x — oo. (See 
Price's Calculus, I. p. 208.) We also have 

^L = Cx a "- b ~ 2 e- a *(a 2 &— 1— ax), ~] 

j* dX ' \ (21) 

|_| = C:c a2J - 3 e-'- [(aa;) 2 — 2(a 2 6— l)aa+(a 2 6--l) (a 2 6-2)], j 

whence it appears that the X axis is tangent to the curve at x = and 
asymptote to it at a; = oo, and that y is a maximum at 

x = ab ~-. (22) 

a 

There are two points of inflexion, equidistant from the maximum, at 

x = ab — I ± iv'(a 2 & — 1). (23) 

Since the first n — 2 differential coefficients will contain both x and e - * as 
factors, it appears that the X axis has a contact of an order as high as the 
n — 2, with the curve at x = and x = cx>. Hence the curve anywhere 
near those points is almost a straight line coinciding with the axis, and val- 
ues of y in the vicinity of either will scarcely differ from zero. This agrees 
with what we know of the extreme smallness of the coefficients at or near 
the two ends of the exjiansion of a binomial or polynomial to a high power. 

Since 2(X) = 1 in the given polynomial and 1(1) — 1 in its expansion, 
we shall have 2(y) = 1 in the curve (16), so that 

which gives the value of C, and we get 

the complete equation of the curve sought. According to the view stated 
in my former articles, by which the limiting form of the expansion of a 
polynomial expresses the most plausible law of facility of error, y will rep- 
resent the probability that any error which occurs will fall within an arbi- 
trary but very small interval dx, the abscissa of whose middle point is x. 
The constants a and b depend on the nature of the observations. The 
quantity b 2 (dxf is the square of what I have called the "radius of gyration" 
of the coefficients A in the polynomial (5), about their centre of gravity ; but 
it might better, perhaps, be called the quadratic radius. 



c +c l2 +o 2 s 2 + ....+c„ 2 ". r ( 26 ) 
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I have shown (Analyst, Jan. and May 1880), that the squared radius 
for the expansion to the h power is h times as great as for the first power. 
It follows that b in (14) is the square of the quadratic mean error, that is, 
the mean of the squares of the deviations of the observations from their 
arithmetical mean. The expression for a in (14) has for its denominator 
the quantity b 3 (dx) 3 , which is of the same nature as b 2 (dx) 2 , except that the 
cubes of the deviations are used instead of the squares, deviations on one 
side being regarded as -f and on the other side as — . We will now prove 
by a precisely analogous method, that this quantity also is h times greater 
in the expansion to the h power, than it is in the first power. 

In my article last cited, any two polynomial factors were denoted by 

a J r a 1 z-\-a 2 z 2j r -\-a m z" 

c o+ c i 2 +°2 j2 + +c„ 

The coefficients a und c may be essentially either + or — . Their sums 
were 

#i = «o+ a i+ + a m, 8 2 = c -\-c 1 + -fc n , (27) 

and h-y and h 2 were the lever arms of the coefficients about the place of the 
first terms, so that 

Sjhy = (<x 1 +2a 2 + . . . +ma m )Jx, 8 2 h 2 —(c 1 - J r 2c 2 + . . . +no n )dx. (28) 

It was proved that the lever arm of the coefficients in the product of the 
two factors, about the place of the first term in the product, is 

H = h t + h 2 . (29) 

The quadratic radii of the coefficients in the two factors, about the places 
of the first terms, were denoted by g x and g 2 , whence 

8 ig \ = (l 2 a x +2 2 a 2 + . . . +m 2 a ro )(zfe) 2 , \ . . 

8 2 g* = (l 2 Cl +2 2 c 2 + . . . + n*c n ){Axf, / ^ 

and putting for the like radius in the product, it was proved that 

G 2 =9\ + 9\ + 2/>A- (31) 

Now let u\ and u\ be formed from the two factors in the same way as 
b s (dxf was. 8-yuf and 8 2 u\ will represent the two sums of all the products 
obtained by multiplying each coefficient by the cube of its + or — devia- 
tion from the centre of forces, so that 

8 x u\ = aoi—hy) 5 +a 1 (Jx—h 1 ) 3 + a 2 (2Jx-h 1 ) z ... +a m (mJx-h l f, \ ,„ 9 \ 
8 2 u\ = c (-h 2 )3+c 1 (Jx-h 2 )s + c 2 (2Jx-h 2 )s...+e n (nJx-h 2 Y.{^> 

For convenience we will call such quantities as u x and u 2 cubic radii 
about the point from which the deviations are reckoned. Let l x and l 2 de- 
note the cubic radii taken about the first term in each polynomial, then 

8 x lf = (13 ai+ 23a 2 + . . . +m«a w )(Jaj)», 1 ,„,* 

8 2 l 2 * = (l»o 1 + 2»o,+ ...+n»o ll )(A ! )8./ ^ 
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From (32) we have 
S t uf = (l»o 1 +2»o 2 + • • +m?a m )(4x) 3 -~3h 1 (l 2 a 1 +2'a 2 + .. +m 2 a m )(Jx) 2 

-\-3h 2 1 (a l +2a 2 + . . +ma m )Ax—h\(a a -\-a 1 + . . +a m ), 
and a similar expression for $ 2 wf . Then by help of (27), (28), (30) and 
and (33), we get 

u\ = l 1 i -Sgfh i +2hl u§ = l 2 * — 8g*h a +2h*, (34) 

Likewise denoting by U and L the cubic radii for the product, about its 
centre of forces and its first term respectively, we shall have 

Z7 3 = L 3 — SG 2 H + 2H 3 . (35) 

The sum of all the coefficients in the product is $ x # 2 . Supposing the 
first polynomial factor to be multiplied successively by the terms of the sec- 
ond one, L 3 will be expressed thus, 

8&L* = o (Pa 1 +2 3 a 2 + . . . +m 3 a m ) {Ax) 3 

+ c 1 [l 3 o +2 3 a l + . . . +(m+l) 3 <] {Ax) 3 
+ c 2 [2'a +3 3 a 1 + . . . +(m+2fa m ] W 
+ ... +c nL > 3 a +(n+l) ! a 1 + . . . +(n+m) 3 aj {Ax) 3 . 
The coefficient of o„( Ax) 3 is reducible by means of (27), (28), (30) and 
(33) to 

Assigning to n the values 0, 1, 2 &c. in succession, we get expressions 
for the coefficients of c {Ax) 3 , c^Ax) 3 , &c, and so find 

8&L* = o flf 1 ^ 1 , +o 1 /8 , 1 [lV*) , +3.1 , Al(^) , +3.1fl'?^+^ 1 •] 
-fo 2 S 1 [2 3 (Ja;)H3.2 i, A 1 (Ja;) 2 +3.2^f Ax+l^] 
+ ... +c n S 1 [n*(Jx)*+3<rv i h 1 (dxy+3ngl Ax+lj*], 
which we put in the form 

S a L* = i 1 1 (o +o 1 + . • • +o n )+SglAx{o 1 +2o 2 + . . . +ne n ) 

+ Sh 1 (Jx)\l i o 1 +2\ +... +n\)+{Ax) 3 {l 3 c 1 +2 3 c a + . . . 

. . + n 3 c n ). 
By help of (27), (28), (30) and (33) this is reduced to 

L 3 = q + 1% + WiK + Wi- (36) 

Substituting in (35) the values of H, G and L from (29), (31) and (36), 

V ' G gC ZJ3 = lf+q -3( fl r?A 1 +^A a )+2(Af+AS), 

and by (34) we have finally 

ZJ3 = uf + u|. (37) 

[To be continued.] 



